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CHAPTER 1 OUTLINE 


The »PD7755 /7756 is a speech synthesis LSI that employs 
the waveform coding method. By combining the ADPCM codiny 
method and phoneme method, natural synthesized speech has 
been realized. Also, the wide range of the operating volta- 
ges, compact packaye, standby function, etc., make these 
LSIs ideal for various application systems such as battery- 
driven systems. 


1.1 Features 


Synthesizing method *Combined ADPCM and phoneme methods 
Sampling frequency 24, 5, 6 or 8kHz 
Bit rate (speech) *8 to 32kbps 


Built-in speech data ROM :96K bits (wPD7755 ) 
256K bits (yPD7756 ) 

Synthesizing time 212 sec. max. (wPD7755 ) 
32 sec. max. (wPD7756 ) 

Built-in D/A converter :9-bit resolution 
current output 


Built-in standby function:Popcorn noise suppressor 


incorporated 
Supply voltage 82.7 to 5.5V 
Process : CMOS 
Package :18-pin plastic DIP 


X1 


X2 


Speech data 
10 


memory 


17 


REF 


D/A 


ADPCM 


System Controller AVO 


converte 


decoder 


Fig. 1-1 System Configuration 


CHAPTER 2 PIN FUNCTIONS 


2.1 Pin Configuration (Top View) 


2.2 Pin Functions 

(1) I0-I7 (Input) 
These eight pins together function as the message 
select input. Positive logic is used for these pins 
(high level=1). Each input is provided with an input 
latch that latches at the rising edge of the ST 
input. During standby mode, these pins should be set 
to either high or low level; If they are biased at 
or near the typical CMOS swith input,causing excess 
current drain. 

(2) CS (Input) 
This is the chip select input pin. When a low level 
signal is input to this pin, the ST input is enabled. 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


ST (Input) 

This is the start signal input pin. When a low level 
signal is input to this pin when the CS signal is 
also low level, speech synthesis of the messaye 
stored in the speech ROM location addressed by the 
contents of IO-I7 begins. 

When the CS and ST signals are both set to low level, 
the standby mode is released. 

BUSY (Output) 

This is an active low level pin that outputs the BUSY 
signal. A low level signal is output from this pin 
when the start signal is accepted; and once this 
output becomes low level, the start input will not 

be accepted. The output from this pin becomes high 
impedance in standby mode. 

REF (Input) 

This pin inputs the reference current for the D/A 
converter. The output current of the D/A current is 
controlled by the current input to this pin. This pin 
should be connected to Vpp via a resistor. In standby 
mode, this pin becomes high impedance, 


AVO (Output) 
This pin outputs the analogy speech signal, which is 


a unipolar, sink current. 


RESET (Input) 


This is the reset signal input pin and is used to 
initialize the LSI at power up, abort the speech 
synthesis, and release the standby mode. To reset the 
chip, this signal must be held low for at least 12 
oscillator clocks. When the standby mode is released, 
at least 12 clocks must be input after clock oscilla- 


tion completes. 


Xl, X2 
These are the clock pins and are connected to a cera- 
mic resonator (640kHz). During standby mode, the Xl 


becomes low level and the X2 becomes high level. 


Z=2 


(9) Vpp 

This pin should be connected to the power Supply. 
(10) GND 

This pin should be connected to yround. 


CHAPTER 3 OPERATION 


3.1 Sample Frequency 
The relation between synthesized speech messaye lenyth 
and bit rate when the ADPCM method is used is shown in 
Table 3-1. 


Table 3-1 Sampling Frequency and Maximum Message Length 


Sampling i Message 
frequency lenyth (sec.) 


uPD7755 PD7756 


For normal speech with no background music,about 10% to 20% of 
the message is made up of silent frames. These frames are 
compressed in the case of the w»PD7755 /7756 so the 

actual bit rate is about 80% to 90% of that shown in 

Table 3-1. 

Even further reductions in the bit rate have been achieved 

by combining the ADPCM and phoneme methods. The bit rates 

for speech synthesis when this combined method is used 


are shown in Table 3-2. 


Table 3-2 Sampling Frequency and Maximum Messaye Length 
(with data compression) 


Sampling Bit rate | Message 
frequency (kbps) length (sec.) 


PD7755 uPD7756 


3.2 D/A Converter 
The built-in D/A converter is a unipolar, current output 
type with a 9-bit resolution. The digital input to the 
converter is converted to a 9-bit offset binary code. 
This code has a range of 0 to 1FFH with a midranye value 
of 100H. 
The schematic drawing of the D/A converter is shown in 
Fig. 3-1. As can be seen, a constant-current source is 
connected in parallel for each bit of the digital input. 


#PD7755/7756 


| Tayo: Output current 
Ry: Load resistance 

AVO fi 
Vo: Potential across RL 


AVO 


verter 


FS 
9-bit digital input 


Fig. 3-1 Schematic Diagram of D/A Converter 


3.2.1 Output Current and Reference Resistance 
The output current of the D/A converter can be controlled 
by the voltage applied to the REF pin. This is realized 
by the resistor connected across the Vpp and REF pins as 
shown in Fig. 3-2. 
The relations between the voltage to the REF pin, VREF, 
and input reference current Ippp or D/A converter output 


current Iyao are shown in Figs. 3-3 and 3-4. 
If the impedance of an amplifier to be connected to the 


AVO pin is as high as several k& or more, refer to Fig. 


3-3 to determine the value of Ippp, In Fig. 3-3, the 


range Of Ipppy is 60 (at the intersection ot the load 
line, Rrer, and the MAX. curve) to 8UyA (at the intersec- 
tion of Rpprp and MIN.) when Rpygp is 50k and Vpp is SV. 
Under this condition, the ranye of Iayy is 1.9 to 2.9mA 
because Iyaop is 32 to 36 times that of Iper. Also, if the 
impedance is high, fluctuations in the output level of 
the synthesized speech are about 4dB, which is not so high. 
Therefore, a highly precise fixed resistor can be used as 
Rper instead of the variable resistor except for occa- 
sions when the synthesized speed output level must be 
constant. In contrast, if the impedance of the amplifier 
connected to the AVO pin is low, determine Rpgr by 
referring to Fig. 3-4. In this fiyure, the ranye of 

Tayo at a Vpp of 5V and a RrReF of 1k is 12 to 30mA. If 
the impedance is low, fluctuations in output levels of 
the synthesized speech are increased to about 8dB. 
Therefore, using a variable resistor such aS Rpg is 


recommended. 


Vpp Vpp 


#PD7755,#PD7756 


Fig. 3-2 D/A Converter Reference Current 
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Fig. 3-3 Relation Between Vppp and Iper/l avo 
(in Low-Current Area) 
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Fig. 3-4 Relation Between Vpgp and Ipggr/I ayo 


(in High-Current Area) 


3.2.2 Setting Output Current 


Because the output current from-the AVO pin (I jis a 


sink-load current, the load resistance (RL) iF cannenbad 
to Vpp. Therefore, the potential at AVO must be within 
the range at which the output transistor of the D/A con- 
verter will operate as a constant-current supply. 
The operating voltage range is: 

When Vpp=5v: 1.5V to Vpp 


When Vpp=3v:. lv to Vpp 
The load resistance and output current should be deter- 


mined so that the operating voltage applied to the output 

transistor falls within this range. 

To adjust output current Iayo, observe the following pro- 

cedure, 

(1) Select the messaye to be synthesized (to allow the 
D/A converter to output the middle(bias) current). 


(2) Input the RESET signal within 3 seconds after the 
synthesized speech has been output (to prevent the 
LSL from entering standby mode). 


(3) Adjust Rrer so that the voltaye applied to thelayo 
falls within the rated value. 


3.3 Standby Mode 
When the yPD7755 /7756 is not performing speech synthesis, 
it is set in standby mode. In this mode, current consump- 
tion is reduced to less than 1yA (TYP). The condition of 
the »PD7755 /7756 in standby mode is as follows: 
° The clock stops 
° The BUSY pin becomes high impedance 
° The REF input pin becomes high impedance 
° The Xl becomes low level and the X2 becomes high level. 
° 10-17, ST, CS, and RESET inputs are all enabled 
° The Tavo becomes 0. 
3.3.1 Entering standby mode 
The ywPD7755 /7756 enters the standby mode if the 
following condition exists for more than 3 seconds after 
completion of speech synthesis: 
(1) CS or ST is high level 
(2) RESET is high level 


3.3.2 Releasing standby mode 
Standby mode is released by the following procedure: 
(1) Set CS to low level 
(2) Set ST to low level 
At this time, the message select code will be input 
to I0-I7. 


3.3.3 Eliminating popcorn noise in,.standby mode 
Because the yPD7755 /7756 uses a unipolar 9-bit D/A 
converter in operation mode, there is a bias current 
output even when there is no signal input (AC signal). 
In standby mode, the input value to the D/A converter 


becomes 0 as does the output current. 


3-6 


Therefore, when the LSI moves from operatiny mode to 
standby mode, popcorn noise may be generated by the 
sudden changes in the output of the D/A converter. To 
prevent this, the output of the D/A converter is grad- 
ually reduced prior to entering the standby mode (see 
Fig. 3-5). 

Also, when chanying from standby to operating mode, the 
D/A output is gradually raised to AC 0 from the time when 
clock oscillation starts and completes in response to the 
standby release signal generated by input of the CS and 
ST signals (see Fig. 3-6). 

The transition time for the D/A output is approximately 


46.5ms and which enough to suppress this type of noise. 


D/A converter input 
1FFH 


AVO 100I1f 


000H 
D/A output 


Operating = transition time Standby mode 


t 
Start standby operation 
Fig. 3-5 D/A Converter Output when Entering Standby Mode 


D/A converter input 
1FFH 


AVO 100H 


000H 
Clock oscilla D/A output 


Standby sa tion time transition time peaahen mod 


Standby release signal 
(Start speech synthesis signal) 


Fig. 3-6 D/A Converter Output when Releasing Standby Mode 
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3.3.4 Pin settings in standby mode 

The following cautions should be observed in regard to 

the BUSY signal output and the message select code input 

(10-17) in standby mode, 

(1) BUSY pin 
Because the output from this pin becomes high imped- 
ance in standby mode in implementations where this 
signal is checked the external controller must be 
configured to insure that this siynal is pulled up. 

(2) I0-I7 pins 
If these pins are not set to high or low level in 
standby mode, there is achance that excess current will 
drain.These pins should therefore be set either to 
high level or low level during standby mode. If these 
pins are connected to a circuit (such as a bus) with 


a floating state, they must be pulled up or down. 


3.4 Start Speech Synthesis 
Speech synthesis begins when the signal input to the ST 
pin goes low when the signal at the CS is already low. 
Note that the operation performed when a pulse is applied 
to the ST pin is somewhat different from that when it is 


fixed at low level. 


3.4.1 ST pulse input 
The timing when the signal input to the ST pin is a pulse 
is shown in Fig. 3-8. The data on IO0-I7 (message select 
data) is latched at the rising edge of the signal input 
to the ST pin. After this data is latched, there will be 
no effect on the speech synthesis operation even if both 
the CS and the ST signals are set to high level. 
Since the speech synthesis start control circuit continues 
to operate during standby mode, the start procedure is the 


same in both standby and operating modes. 


3.4.2 ST fixed input 
When the ST input is fixed at low level, the speech 
synthesis operation is performed repeatedly (see Fiy. 3-9). 
In this state, the messaye select code (the contents of 
I0-I7) are not latched so this data should not be chanyed. 


Changing this data may result in misoperation. 


3.4.3 Standby mode 
Since the speech synthesis start control circuit con- 
tinues to operate during standby mode, the same procedure 
as that described in 3.4.1 can be used to start speech 
synthesis. In this case, the clock oscillation begins 
when both the ST and CS pins become low level and the 
BUSY signal is output after the clock oscillation has 
become stable. Then, as described in section 3.3, the 
speech output begins approximately 46.5ms later (See Fig. 
3-10). 


Table 3-2 Speech Synthesis Start Timing 
(Vop= +2,.7~5.5V, fosc 


Parameter 


= 640KHz) 


BUSY output tspo | Operation mode | | 6.25 fio | us | 


| tsBo_| 
time tsps | Standby mode * 
(includes 
” oscillation 
start time) 


tsso_|operation moae| | 2.1 | 2.2 | ms | 
start time tsss | Standby mode il 


D/A converter tDaA Standby mode 47 
transition time 


Note: Ceramic resonator Kyocera Corp. KBR-640B, Cl=C2=150 pF 


Xi 
_—| 
X2 
om | c2 
tr rh 


Fig. 3-7 Oscillator 


Speech output 
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3.4.4 At reset or power up 
If the RESET signal is input while a speech is beiny 
synthesized, or on power application, the data input to 
the D/A converter is undefined but within the limit of 0 
to 100H. 
If speech synthesis is started under this condition with 
both the ST and CS pins being low level, the input data 
of the D/A converter is shifted to 100H and then the 
speech is output. 
The input to the D/A converter is shifted yradually to 
100H in the same manner as described in section 3.3, the 
synthesized speech will be output a maximum of 47ms after 
the RESET signal has been input or the power has been 


applied. 


H 

cs - es pemgeees “oul 
L- 
H 


mw \ foe 
i. = ; oT pulse width 


' 
r— tcc 


i] 
| | Speech output start time 


Fig. 3-8 Speech Synthesis Start Input 
(When ST Input is Pulse) 
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Fig. 3-9 Speech Synthesis Start Input 
(When ST Input is Fixed Low Level) 
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Fig. 3-10 Speech Synthesis Start Operation 


(Standby Mode) 


3.5 Reset 
The RESET signal is used to initialize the LSI on power 
application; terminate the synthesized speech output, or 
release the standby mode. 
The »wPD7755 /7756 is reset by holdiny the RESET pin to 
low level for the following period of time. 


° During operation 12 oscillator clocks (min.) 
(18.75ys at 640kHz) 
° On power application The same as above after 
in standby mode clock oscillation is completed 


CHAPTER 4 INTERFACES 


4.1 Message Select Input 

4.1.1 Host control mode 
Fig. 4-1 shows an example when a yPD80C48 is used as the 
host CPU. The messaye select code is output to the data 
bus and this data is written to the ywPD7755 /7756 by 
setting the CS and ST signals to low. 
Because the output of the BUSY pin becomes hiyh impedance 
in standby mode, this signal must be pulled up. (In the 
example in Fig. 4-1, because the ports of the w»PD80C48 
are provided with built-in pull-up resistors, the pull-up 
resistor shown can be omitted.) Also, in standby mode, if 
the bus to which I0-I7 is connected becomes hiyh 
impedance, there isachance that excess current will drain.In 
this case, these signals should always be connected 
either to pull-up or pull-down resistors. Fig. 4-2 shows 
a flowchart when control is performed by the host CPU. The 
circuit that checks the BUSY output in standby mode must 
be provided with a wait equivalent to the clock oscilla- 


tion start time. 


4.1.2 Key input mode 
Fig. 4-3 shows an example of the wPD7755 /7756 used 
alone. If the ST input switch is fixed to ground, the 
speech synthesis output will repeat indefinitely. 
Unnecessary I0-I7 inputs should be connected to ground. 


HPD80C48 — HPD7755,#PD7756 
* WR ST * 


Fig. 4-1 Circuit Example when Control is by Host 


* Use 5V+10% for Vpp. When a 3V line is used, the 


minimum pulse width for the yPD7755 /7756 ‘ST input 
becomes 2s and it no lonyer possible to use the WR 


output of the »PD80C48. 


Start 


Output low to 
cs 


Set message 


select data 


in accumulator 


Output contents 
of accumulator 


to data bus 


; Check wPD7755 /7756 
BUSY output 


; WR is output with output 
to bus; WR functions as 


start signal 


Fig. 4-2 Flowchart (Control by Host) 
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Fig. 4-3 Key Input Mode Circuit Example 


4.2 Lowpass Filter 
Signals that have been digitally analyzed and synthesized 
by a sampling method can only reproduce frequency ranyes 
up to one half the sampling frequency. Frequencies above 
this range appear as unwanted noise and if the D/A output 
is directly amplified, the result would be a poor quality 
sound with an extremely unfavorable S/N ratio. It is 
therefore necessary to use a lowpass filter that will 
only allow signal frequencies of less than half the 
sampling frequency to pass. 
A filter with sharp bandpass characteristics is desirable 
and ideally a filter with damping characterics of 
48dB/oct or better should be used. The complexity of such 
a filter, however, is prohibitive and we have opted, in 
the interest of simplicity and operability, for a filter 
with characterisics of 24dB/oct for use in the »PD7755 / 
7756 . Fig. 4-4 shows a pPC358C analog filter with 
Butterworth damping characteristics (24dB/oct) that can 
operate on a +5V single power supply. 


Voc Veo 


5ka. 
#PD7755, 
#HPD7756 


#PC358C 


FILTER 


AVO OUT 


Fig. 4-4 Lowpass Filter Circuit Example 


When the yPC358C operates on +5V, input signals in a 
range of 0 to 3.5V may be used. Resistors Rl and R2 
should therefore be used to set the operating point to 
2..5V. 

Table 4-1 shows the appropriate constants when sampling 


frequencies (fsypr) of 4, 5, 6, and 8kHz are used. 


Table 4-1 Lowpass Filter Constants 
(24dB/oct Butterworth characteristics) 


el 0.022 uF 
C2 0.0022 uF 
C3 0.022 uF 
c4 0.01 uF 
RL 1 KQ 


4.3 Power Amp 
Because the signal obtained as the output of the lowpass 
filter described in the preceding paragraph has a wave- 
form identical to that of a normal analog signal, this 
output can be input to any type of power amp as required. 
Fig. 4-5 shows an example of a power amp (ywPC1212C) with 
a low operating voltage that can be used with the +5V 
single power supply of the »PD7755 /7756 . 


Fig. 4-5 Power Amp (0.7W, 5V) 


Because the yPC1212C can operate on voltayes in the range 
of 4.5V to 7.0V, if a 42 speaker is used with 5.0V power 
supply, a 0..7W (THD=10%) output is obtained. 


CHAPTER 5 SPEECH ANALYSIS 


The procedure for developing a ROM code for the yPD7755 / 
7756 is shown in the flowchart in Fig. 5-l. The main points 


of the procedure are described below. 


Development 
decision 


Message script 


z Message recording 


User 
Original speech 
evaluation, editing 
Parameter 
selection 
Analysis 
order 
Analysis/coding 
NEC 
NG 
Parameter ! 
adjustment | 
User 


Mask ROM order 


Fig. 5-1 ROM Code Development Flowchart 


5.1 Writing Message Script 
Because the fequency bandwidth of the reproduced speech 
is narrower than that of natural speech, care should be 
taken to write a script that avoids the use of ambiguous 
expressions. Also, if messages that share common phrases 


are used, ROM area can be used effectively. 


5.2 Original Speech Recording 
Because the effective dynamic range for synthesized 
speech is narrow, the speaker (announcer) should take 
care to suppress inflections and to speak in an level, 
even tone of voice. 
Because the synthesizing process sometimes emphasizes 
noise in the original voice recording, care should be 
taken to keep all noise (particularly hum) to the abso- 
lute minimum. For this reason, the use of a professional 
recording studio using open reel tapes is recommended. 
Also, to permit selection of the sample with the best 
speech quality and the least noise, you are asked to 


provide several recordings of each message. 


5.3 Original Speech Evaluation and Editing 
Selection should be made from among the recorded messages 
taking into consideration lack of noise, even tone of 
voice, sound quality, etc. The messages thus selected 
should be edited onto an open reel or cassette tape. When 
an open reel tape is used, either the full tape or two 
tracks may be used. For a good S/N ratio when using a 
cassette, the use of metal tape is recommended. The 
recording should be done at a somewhat high level with 
peak values are in the range of +5 to +8dB. To ease the 
analysis process, the format shown in Fig. 5-2 should be 
used with the messages appropriately divided by blanks. 


Tape direction Start of tape 


Messaye Message Blank Message Blank Message Blank 
#7 #2 (3 to 5 sec) F1 (3 to 5 sec) ¥0 (15 to 30 sec) 


Fig. 5-2 Original Speech Tape Format for Analysis 


When a cassette is used, only side A should be used. 


5.4 Parameter Selection 
The parameters to be used must be determined before the 
order for analysis is placed. For the yPD7755 /7756 , the 
only parameter that must be selected by the user is the 
sampling frequency. 
Sampling frequency selection 
Please specify 4, 5, 6, or 8kHz as the sampling frequency. 
When 6kHz is specified, the frequency bandwidth is almost 
identical to that of a telephone. When 4kHz is selected, 
the bandwidth drops to only those frequencies under 2kHz 
and there is some loss of clarity. Although no definitive 
statement can be made about selection of the sampling 


frequency, the guidelines laid out in Table 5-1 can be 


used, 


frequenc (bps) 
4kHz Male voice/ 8 to 16K 
sentences 


5kHz Male voice/ Slightly 10 to 20K 
single words; | distorted 
female voice/ 
sentences 


6kHz Female voice/ | Equivalent 12 to 24K 
single words to telephone 


conversation 


16 to 32K 


Note that a great deal of noise in the messaye will cause 
the bit rate to be high. 


5.5 Ordering Speech Analysis 
Place your order for analyis after making the prepara- 
tions described above. Be sure to include all of the 
following when placing your order: 
Original speech tape (with company name and date clearly 
indicated) 
Message list (message selection codes in the same order 
as messages are recorded on the tape) 
Parameter specification (sampling frequency) 
All tapes should include the company name, section, and 


name of person responsible. 


5.6 Evaluation 
A cassette tape with a recording of the synthesized 
speech (result of analysis and coding) will be returned 
to the client along with the ROM code on an 8-inch floppy 
disk, 
If the result of evaluation is satisfactory, order the 
mask ROM specifications. 
If there are any problems, contact NEC. 


NEC cannot assume any responsibility for any circuits shown or 
represent that they are free from patent infringement. 

NEC reserves the right to make changes any time without notice. 
© by NEC Electronics (Europe) GmbH 


sarope) G mbH, Oberrather Str. 4, D- 4000 Disseldorf 30, W.-Germany, Tel. (0211) 650301 Telex 858 996 
. -0 


rmmany) GmbH, Oberrather Str. 4, D-4000 Disseldorf 30, Tel. (0211) 650302, Tel 
3s/>O,D- <3000 Hannover 1, Tel. (0511) 88 1013-16, Telex 9230109 ere 
~ £8000 Munchen 81, Tel. (089) 4160020, Telex 522971 
344, D- 7000 Stuttgart 30, Tel. (07 11) 890910, Telex 7252220 


enelux),. Boschdijk 187 a, NL- 5612 HB Eindhoven, Tel. (040) 4458 45, Telex 51923 
sc andinavia) — Box 4039, S-18304 Taby, Tel. (08) 7567 245, Telex 13839 
raance) S- A.., Tour Chenonceaux, 204, Rond Pointdu Pontde Sévres, F-92516 Boulogne Billanco: 
4 urt, Tel. 
Aliana s-r.l., Via Cardano 3, I-20124 Milano, Tel. (02) 6709 108, Telex 315355 Tel. (01) 6 999004, Teles. 203 
3i<) Ltd., Block 3 Carfin Industrial Estate, Motherwell ML1 4UL, Scotland, Tel. (0698) 7322 24 Telex 7775 a 
’ 65 


